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Persistent Homology Lower Bounds on 
High Order Network Distanees 

Weiyu Huang and Alejandro Ribeiro 


Abstract —High order networks are weighted hypergraphs col¬ 
lecting relationships between elements of tuples, not necessarily 
pairs. Valid metric distances between high order networks have 
been defined but they are difficult to compute when the number 
of nodes is large. The goal here is to find tractable approximations 
of these network distances. The paper does so by mapping high 
order networks to filtrations of simplicial complexes and showing 
that the distance between networks can be lower bounded by the 
difference between the homological features of their respective 
nitrations. Practical implications are explored by classifying 
weighted pairwise networks constructed from different gener¬ 
ative processes and by comparing the coauthorship networks of 
engineering and mathematics academic journals. The persistent 
homology methods succeed in identifying different generative 
models, in discriminating engineering and mathematics commu¬ 
nities, as well as in differentiating engineering communities with 
different research interests. 

Index Terms —Network theory, networked data, high order 
networks, computational topology, pattern recognition 

I. Introduction 

High order networks describe relationships between ele¬ 
ments of tuples that go beyond the pairwise relationship 
that are more often considered a, II. High order networks 
are important in applications in which relationships between 
triplets, quadruplets, and generic n-tuples are important. For 
example, coauthorship networks can be constructed by de¬ 
tailing the number of joint publications by pairs of scholars, 
but adding information on publications of individual authors 
and joint publications by specific triplets of authors generates 
a more accurate description. The relevance of expressing 
tuple relationships other than pairwise has been noted in 
various domains, including sensor networks 0, 0, cognitive 
learning Q, wireless networking IS), image ranking 0, object 
recognition m, and social networks HD. 

The goal of this paper is to develop efficient methods to 
quantify differences between high order networks of possibly 
different sizes. Ideally, differences should be quantified using 
distances derived from an underlying metric in the space of 
high order networks. However, while distances are not difficult 
to define, they can be intractable to compute. This difficulty 
has motivated the use of feature comparisons in which the 
difference between specific properties of the network is used 
as a more manageable alternative. Examples of feature com¬ 
parisons are clustering coefficients 02, neighborhood topolo¬ 
gies m, betweenness m, motifs iia, wavelets IThll . and 
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graphlet degree distributions CD, im. While computationally 
simpler, the use of features is application dependent, utilizes 
only a small portion of the information conveyed by the 
networks, and may yield conflicting comparative judgements 
because the triangle inequality is not necessarily valid. A 
proper distance between pairwise and high order networks 
overcomes these drawbacks. This is the motivation for the 
definition of network metrics that generalize the Gromov- 
Hausdorff distance between metric spaces HD, GqI to pair¬ 
wise ED and high order networks Il22l . 

The metric distances between high order networks defined 
in II 22 II have been applied to compare networks with small 
number of nodes and have succeeded in identifying collabora¬ 
tion patterns of coauthorship networks. However, because they 
have to consider all possible node correspondences (Definition 
[T]i, network distances are difficult to compute when the number 
of nodes in the networks is large. The goal of this paper is to 
develop network discrimination methods that are computable 
in networks with large numbers of nodes. These discrimination 
methods are constructed by drawing a parallel between high 
order networks and algebraic topology filtrations. Homological 
features of filtrations are then used to compare high order 
networks and shown to provide lower bounds for the actual 
network distances. Although distance lower bounds suffer 
from some of the same problems associated with feature com¬ 
parisons, they nonetheless have important properties. Among 
them, we know that a large lower bound entails a large distance 
and that we can use lower bounds to estimate distance intervals 
because upper bounds are easy to determine. 

The paper begins with a review of high order networks 
and high order network distances (Section HI] and ED). We 
then introduce concepts from algebraic topology ED, E4ll and 
discuss the representation of high order networks in terms of 
filtrations (Section HIHi . Specifically, we revisit the definition 
of a homological feature as a formalization of the notion of 
a hole without an interior in an unweighted network and the 
definition of a filtration as a nested collection of unweighted 
networks indexed by a time parameter. The most important 
observation in this preliminary discussion is that a class of 
high order networks can be represented as a filtration that starts 
empty and to which nodes, edges, and high order tuples are 
added at a time given by their weight (Section flll-Ab . We then 
introduce persistence diagrams to monitor the appearance and 
disappearance of homological features from the unweighted 
networks that compose a filtration ('Section llll-Bb . 

We propose to compare networks by considering their 
respective filtrations and using the difference between the 
persistence diagrams of the respective filtrations as a proxy 
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for their distance. (Section II3- This proposed methodology 
is substantiated by the fact that we can lower bound the 
computationally intractable distance between two high order 
networks with a tractable distance between their respective 
persistence diagrams (Theorems [T] and |2] in Section lTVl i. These 
lower bounds are tight such that there are examples in which 
the lower bound and the actual distances coincide. Since 
persistent homologies can be computed efficiently for large 
networks (Section |VT] and ||25]| . Il26l l. we can use these lower 
bounds in, e.g., network classification problems. We do so 
for artificial networks created with different models - random 
networks, Gaussian kernel proximity networks, and Euclidean 
feature networks - and for coauthorship networks constructed 
from the publications of a number of journals from engineering 
and mathematics communities. The proposed methods succeed 
in distinguishing networks with different generative models 
(Section [VII-Al l and are also effective in discriminating engi¬ 
neering journals from mathematics journals (Section IVII-BI) . 
We also attempt a more challenging classihcation problem of 
three different engineering communities where we achieve a 
moderate success (Section IVlI-Cb . We close with concluding 
remarks (Section IVlllb . 

11. High Order Networks 

A network of order K over the node set X is 

defined as a collection of AT + 1 normalized relationship 
functions {r^ ■ [0, l]}^o from the space 

of (fc + l)-tuples to the closed unit interval [0,1]. For tuples 
xo-,k ■= xo,xi,... ,Xk, the values r’^ixo-.k) of their fc-order 
relationship functions are intended to represent a measure 
of similarity or dissimilarity for members of the group. We 
restrict attention to symmetric networks defined as those in 
which the relationship function depends on the elements of 
the tuple but not on ordering nor repetition. This implies, 
e.g., that r'j^{x,x,x') = r]^{x,x') because x is repeated 
in the tuple x,x,x' and that r\{x,x',x") = r\{x',x,x") 
because the tuples x^x',x" and x',x,x'' contain the same 
elements in different order. Observe that the network 
can be considered as a weighted complete hypergraph IZTl 
whose weights for all possible hyperedges are defined. Since 
relationships are not explicitly defined for all hyperedges 
in many cases, we adopt the convention that the undefined 
hyperedge relationships take the value 0 or 1, depending on 
whether the relationships encode dissimilarities or similarities 
(see Section III-Al and Section|V]i. The value of K is no greater 
than 2 in most applications. The set of all high order networks 
of order K is denoted as 

Two networks and are said fc-isomorphic if there 
exists a bijection tt : X —>■ Y such that for all XQ-,k € X^^^ 
we have 

TylTrjxo:*)) = r^(xO:fc), (1) 

where we use the shorthand notation r^(7r(a:o:fc)) := 

rY{TT{xo),Tr{xi),... ,TT{xk))- Since the map tt is bijective, 
o can only be satisfied when is a permutation of Y. 
When networks and are fc-isomorphic we write 
=k Ny ■ The space of AT-order networks where k- 
isomorphic networks are represented by the same element 


is termed the set of AT-networks modulo /c-isomorphism and 
denoted by mod =k- For each 0 < k < K, the space 
mod ^k can be endowed with a pseudometric ll2^ . 
The definition of this distance requires introducing the notion 
of correspondence ll28l Def. 7.3.17]: 

Definition 1 A correspondence between two sets X and Y is 
a subset C ^ X x Y such that \/ x € X, there exists y G Y 
such that {x, y) G C and \/ y G Y there exists x G X such 
that {x,y) G C. The set of all correspondences is denoted as 
C{X,Y). 

A correspondence in the sense of Definition[T]connects node 
sets X and Y so that every element of each set has at least one 
correspondent in the other set. We can now define the distance 
between two networks by selecting the correspondence that 
makes them the most similar as we formally define next. 

Definition 2 Given networks and Ny , a correspondence 
C between the node sets X and Y, and an integer 0 < k < K 
define the k-order network difference with respect to C as 

'^'xy{C)-=, max \r\{xQ..k) - rY{yo,k)\, (2) 

(a^O;fc,yO;fe)^C' 

where (xo:fc,j/o:fe) stands for (a:o,t/o), {xi,yi), ■ ■ ■ ,{xk,yk)- 
The k-order network distance between and Ny is defined 
as 

d^N^, iVf) := {r^,y(C)} . (3) 

For a given correspondence C G C{X,Y) the network 
difference F^ y (C) selects the maximum distance difference 
— ryivo-.k)] among all pairs of correspondents - we 
compare r\{xw.k) with ry^yo-k) when all the points xi G x^-k 
and yi G yo-.k are correspondents. The distance in 0 is defined 
by selecting the correspondence that minimizes these maximal 
differences. The distance in Definition |2] is a pseudometric 
in mod =k CD. Observe that since correspondences 
may be between networks with different number of elements, 
d^{N^, Ny ) is defined when the node cardinalities |2f| and 
|F| are different. 

The distances in 0 compare relationship functions of a 
given order k. To compare all orders 0 < k < K we begin by 
defining two AT-order networks and Xp as isomorphic 
if there exists a bijection tt : X ^ Y such that O holds 
for all 0 < fc < AT and xo-,k G When networks N^ 

and NP are isomorphic we write XP = Xp. The space 
of AT-order networks modulo isomorphism is denoted as 
mod =. A family of pseudometrics measuring the difference 
over all order functions can be endowed to mod = as 
we formally state next. 

Definition 3 Given networks Xp and Xp, a correspondence 
C between the node sets X and Y, and some vector p-nonn 
II • lip, define the network difference with respect to C as 

(rx,Y(C), (c), • ■ ■, rj^,y(C')) , 

p 

(4) 
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Fig. 1. Temporal dynamics for the formation of a research community. The 
k-order relationship in this 2-order dissimilarity network [cf. Definition [4) 
denotes the normalized time instant when members of a (/c + l)-tuple write 
their first joint paper. E.g., A writes her first paper at time 0, and coauthors 
with B, D, C at times 2/9, 4/9, 5/9, respectively. The e in the relationship 
function is a technical modification to differentiate relationship between two 
authors from that of a single author. Author A also writes jointly with B and 
D at time 4/9. 

where y (C') is the k-order difference as per C defined in 
(01. The p-nonn network distance between and Ny is 
defined as 

dN',p{Nx jNy) ■= I ||r;^_y((7)||^| . (5) 

In Definition 0 we compare relationship functions of all 
orders no larger than K. The norm ||r^y(C')||p is assigned 
as the difference between and Ny measured by the 
correspondence C. The distance d^^p{N^,Ny) is then the 
minimum of these differences achieved by some correspon¬ 
dence. 

The fc-order function of a network does not impose 
constraints on the Z-order function of the same network. In 
practical situations, however, it is common that adding nodes 
to a tuple results in increasing or decreasing relationships 
for the extended tuple. This motivates dissimilarity networks, 
which we do next, and proximity networks, which we discuss 
in Section FV] 

A. Dissimilarity Networks 

In dissimilarity networks, r^{xQ-.k) encodes a level of 
dissimilarity between elements. In this scenario it is reason¬ 
able that adding elements to a tuple makes the group more 
dissimilar and results in a higher value in the relationship 
function. This restriction makes up the formal definition that 
we introduce next. 

Definition 4 We say that the K-order network = 

(X, ..., is a dissimilarity network if the order in¬ 

creasing property holds, i.e. for any order 1 < k < K and 
tuples Xo:k € we have 

Ai^o-.k) > rA^{xo-.k-i), ( 6 ) 

and the inequality dU equalizes if and only if all the ele¬ 
ments in xw.k <^lso appear in xw.k-i- Denote the set of all 
dissimilarity networks of order K as D^. 

As per Dehnition|4]dissimilarities A{xo:k) grow as we add 
elements to a tuple. Notice that due to symmetry, a relationship 
as in (01 holds if we remove an arbitrary node from the 
tuple xo:k, not necessarily the last. We further emphasize that 
Definition 0 includes the condition of having the inequality in 
(0 be strict unless all the elements of XQ-,k appear in xo-,k-i. 
This implies that to have r^{xQ-.k) = {xw.k-i) it must 


be that at least one element of xw.k is repeated in the tuple. 
Conversely, when we add a distinct element to a tuple, the dis¬ 
similarity must increase, i.e, r^{xw.k-i,Xk) > rA^{xw.k-i) 
if Xk is not a repetition of an element of Xo:k-i- This condition 
is technical and of little consequence for the results presented 
later - see Remark 0 

To see why the order increasing property is reasonable, 
consider a network describing the temporal dynamics of the 
formation of a research community - see Figure 0 The 
relationship function in this network marks the normalized 
time instant at which members of a given (fc -|- 1)-tuple write 
their hrst joint paper. For fc = 0, the zeroth order dissimilarities 
are the normalized time instants when authors publish their 
hrst paper. In Figure 0 authors A, B, C, and D publish their 
hrst papers at times 0, 1/9, 5/9, and 3/9. The hrst order 
dissimilarities between pairs denote times at which nodes 
become coauthors. Since a coauthored paper is also a paper 
by each of the authors, it is certain that r\-(x,x') > r^{x). 
In Figure 0 A and B become coauthors at time 2/9, which 
occurs after they publish their respective hrst papers at times 
0 and 1/9. Authors A and D become coauthors at time 
4/9 and A and C become coauthors at time 5/9. Second 
order dissimilarities r\ denote the time at which a paper is 
coauthored by a triplet. Since a paper can’t be coauthored 
by three people without being at the same time coauthored 
by each of the three possible pairs of authors we must have 
r\{x,x',x") > r^{x,x'). In Figure 0 authors A, B, and 
D publish a joint paper at time 4/9, which is no smaller 
than the pairwise coauthorship times between each two of the 
individual authors. Undehned hyperedge relationships in dis¬ 
similarity networks take the value 1 indicating the maximum 
dissimilarities for the tuple. An additive constant e is used to 
make r\{A,C) strictly larger than r^(C') and r\{A,B,D) 
strictly larger than r\r{A,D) and r^{B,D). This is needed 
because of the requirement in Definition 0 that the equality in 
0 be achieved only if the elements in xo-,k are all elements 
of X0:k-1- 

When specialized to dissimilarity networks, the network 
distances in dehnitions 0 and 0 are termed the fc-order dis¬ 
similarity network distance and the p-norm dissimilarity 
network distance d-p^p- The restriction to dissimilarities makes 
the distances well-defined metrics in their respective spaces 
Ea. Although these metrics provide well-founded methods 
to compare high order networks, the search for the optimal 
correspondences in 0 and 0 is combinatorial. This makes 
it impossible to evaluate distances when the number of nodes 
in the networks is large and motivates the development of 
tractable lower bounds. These bounds are obtained by relating 
dissimilarity networks to hltrations of simplicial complexes 
lED, II 24 I as we explain next. 

III. Networks and Simplicial Complexes 

We introduce elemental notions of computational topology 
as they apply to the study of high order networks. Begin by 
dehning a fc-simplex = [xo:fc] as the convex hull of the set 
of points xo-,k (see Figure 0 and a simplicial complex L as 
the collection of simplices such that for any simplex [xo:fe], the 
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vertex [a] 
0-simplex 



Fig. 2. Elementary /c-simplices for 0 < fc < 3. 


convex hull of any subset of x^-k also belongs to L. Figured 
(a) exemplifies two triangles connected together as a simplicial 
complex of dimension 2. It is a simplicial complex because 
for any of its simplices, say, [a,b,c], the convex hulls of the 
subsets of {a, b, c} - which include the points [a], [6], and [c] 
as well as the edges [a, 5], [b, c], and [a, c] - all belong to the 
simplicial complex as well. 

An important concept in simplicial complexes is that of a 
hole without interior. For the simplicial complex in Figure [3 
(a), the area enclosed by [b,d], [d,c], [c,b] is a hole without 
interior because the area is not filled. The area enclosed by 
[a, 6], [6, d], [d, a] is not because the interior is filled by the 
2-simplex [a,b,d]. Homologies are defined to formalize this 
intuition and rely on the definitions of chains, cycles, and 
boundaries. The k-chain fdicpi is a summation of k- 

simplices (j)i modulated by coefficients fii whose signs denote 
orientation. This definition is a generalization of the familiar 
definition of chains in graphs. E.g., in Figured [a, b] + [b, d] 
is a 1-chain that we can equivalently represent as [a, 6] — [d, b]. 
Further consider a given fc-simplex (p = [xoifc] and its border 
{k — l)-simplices defined as the ordered set of elemenets 
[^oTfc] “ conv{xo:fc\a;j}, in which each of the elements is 
removed in order. The boundary dk4> of the simplex (p is the 
chain formed by its borders using alternating orientations, 

k 

9kp=Y.(-^yKT-.k\- (7) 

1=0 

According to the boundary dkp of a fc-simplex is the 
collection of (fc — l)-simplices. For the simplices in Figured 
the boundaries are 9o[a] = 0,cli[a, 6] = [6] — [a],d 2 [a,b,c] = 
[6, c] — [a, c] + [a, b] and 83 [a, b, c, d] = [6, c, d] — [a, c, d] + 
[a, b, d] — [a, b, c]. 

Having defined chains and boundaries we consider a fc-chain 
<i)fe = define the chain boundary dk^k as the 

summation of the boundaries of its component simplices, 

dk^k =^ 8 ^i^kpi)■ (8) 

i 

As per this definition, the boundary of the chain $ = [a,b] + 
[6, d] in Figure [3 is = [a] — [5] + [6] — [d] = [a] — [d], which 
matches our intuition of what the chain’s boundary should be. 
We can now formally define a k-cycle as a chain whose 
boundary is null, i.e., a fc-chain for which dk'^ = 0. In Figure 
[3 the chain T' = [a, 5] -I- [b, d] + [d, a] is a cycle because its 
boundary is dk^ = [a] — [b] + [b] — [d] + [d] — [a] = 0. 

A fc-cycle is a k-homological feature if it cannot be 
represented as the boundary of a (fc-l-l)-chain $fc+i; otherwise 
T'fc is a k-boundary. The group T-Lk{L) of fc-homological 



Fig. 3. (a) An example of simplicial complex L which consists of four 0- 

simplices, five 1-simplices, and one 2-simplex. (b) A dissimilarity network can 
be represented as a simplicial complexes with weights. When the weight of a 
simplex denotes the time instant the simplex appears in the nested sequence 
of simplicial complexes, it yields a valid filtration C. 



Fig. 4. More examples of simplicial complexes and homologies. The number 
of 2-homological features are described below each of the simplicial complex. 


features of a simplicial complex L represents all fc-cycles that 
are not fc-boundaries. I.e., a homological feature appears when 
a fc-cycle is not among the group of boundaries dk+i^k+i 
of (fc -f l)-chains. 

For the simplicial complex in Figure [3 (a), the chain 
T*! = [a, 6] -I- [6, d] -I- [d, a] is a 1-cycle because 9iT'i = 0; 
simultaneously, T*! is also a 1-boundary because 92[a, 6, d] = 
[6, d] — [a, d] -f [a, b], which is identical to T*!. Therefore, 
does not represent a homological feature. On the other hand, 
the chain = [6, c] + [c, d] + [d, b] describes a homological 
feature because it is a cycle with = 0, and not a 

boundary due to the fact that the 2-simplex [6, c, d] is absent 
from the complex. 

Other illustrative examples are shown in Figure 0] The 
complex (a) has one 1-homological feature corresponding to 
the chain [a, b] + [6, c] + [c, a] and the complex (b) has 1 
feature as well represented by [a, 6] -f [6, c] -f [c, d] -f [d, a]. 
Complex (c) has 2 features that are represented by chains 
[a, b\ -\- [6, d] -I- [d, a] and [6, c] -f [c, d] + [d, b] but complex 
(d) has one feature because the chain [a, b] + [6, d] -f [d, a] is 
the boundary of the simplex [a, 6, d]. 


A. Representation of High Order Networks as Filtrations 

In the same manner in which a graph represents an un¬ 
weighted network, a simplicial complex represents a high 
order unweighted network. To represent weighted high order 
networks we assign a weight to each simplex, but instead 
of thinking the network as a weighted simplicial complex 
we think of weights as parameters that indicates the time 
at which the simplex comes into existence. Formally, for 
parameters a € [0,1] we define a filtration £ as a collection of 
simplicial complexes such that for any ordered sequence 
0 = ao < «! < ■ • ■ < ttm = 1 it holds 0 = C L^ti ^ 

... C La^ = L. The minimum time a at which a simplex 
becomes an element of La is the birth time of the simplex. 

Given a dissimilarity network D^, we construct a filtration 
C{D^) by assigning the appearing time of simplex [xo:fc] as 
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j_I_Lj:i_i_L 

filtration 


yy yyy 


Fig. 5. Top: filtration of nested simplicial complexes of the dissimilarity 
network exhibited in Figure [3] (b). The simplicial complex at each time instant 
are the collections of all vertices appearing before or on that time. Bottom: 
1-persistence diagrams describing 1-persistent homologies for each of the 
simplicial complexes detailed in Top. The horizontal axis denotes the birth 
time of homological features and the vertical axis represents the death time. 





0 0.5 0 0.5 0 0.5 0 0.50.7 


(a) (b) (c) (d) 

Fig. 6. The 1-persistence diagrams for the filtration induced by each of the 
dissimilarity networks. The maximum death time of any homological features 
is 1 because undefined hyperedges take value 1 implicitly. 


the relationship r^{xQ-k) of the corresponding tuple, 

[xo:k] e La <5=^ rxixo-.k) < a. (9) 

We show next that ® defines a valid filtration. 

Proposition 1 The filtration C{Dx) with elements La in¬ 
duced from a given dissimilarity network through the 
relationship in dUl is a well defined filtration. 

Proof: For a given a, (l9]l defines a set of complexes La- 
It is clear that the collection of sets La is nested, i.e., that 
0 = Lao Q ■ ■ ■ ^ La^ = L holds for any set of birth times 
0 = ao < ■ • ■ < ctm = 1. To prove the statement we need 
to show that the set of complexes La is a valid simplicial 
complex. To show this it suffices to verify that for any a, 
all faces of each simplex in La also appear no later than a. 
Suppose simplex [xo:fc] appears before time a for some chosen 
a, then it must be true that r^{x[)-k) < a. For any faces of 
[a:o:fc]. say with k < k, the order increasing property 

implies r\{xQ.j.) < r^{x[)-k). Therefore, 

rxi^o-.k) ^ r^xi^o-.k) < a, (10) 

which shows that the face appears before time or on 

time a. This means that La is a valid simplicial complex. □ 

We emphasize that filtrations can’t be defined for an ar¬ 
bitrary high order networks because the order increasing 
property is necessary for the proof. For dissimilarity networks 
that do satisfy the order increasing property. Proposition [T] 
establishes the existence of an equivalent filtration. 


Figure |5] (top) shows the construction of the filtration 
associated with the dissimilarity network in Figure [3 (b). At 
time a = 0, there are four nodes because r^ia) = r^ib) = 
'^xi'^) ~ = 0. At time a = 0.1, the edge [a, 6] starts 

to appear in the simplicial complex because the relationship 
rx{a,b) = 0.1. As time gradually increases, more simplices 
get included in the simplicial complex at each time instant. 
At the end of the filtration with a = 1, all simplices in the 
original simplicial complex are involved. 


B. Persistent Homologies and Persistence Diagrams 

As time advances in a filtration, holes and interiors appear 
and disappear. Persistent homologies examine when these 
homological features appear for the sequence of simplicial 
complexes in the filtration. Formally, given a filtration C, 
its fc-dimensional persistence diagram VkL is a collection 
of points of the form q = [(?b, qd] where qi, and qd > qb 
represent the birth and death time of a homological feature. 
I.e., the times qb represent resolutions a = qb which 
a feature is added to the group of fc-homological features 
T~lk{La) = T~Lk{Lqf) and the times qd are resolutions a = qd 
at which a feature is removed from the group of fc-homological 
features HkiLa) = 

The persistence diagram for the filtration in Figure |5}(top) 
is shown in Figure |5}(bottom). In this diagram the horizontal 
axis denotes the birth time of 1-homological features (when 
holes appear) and the vertical axis represents the death time 
of 1-homological features (when holes are filled). At time 
a = 0, there are no 1-simplices and consequently no 1-holes. 
The first 1-cycle appears at time a = 0.5 when the 1-chain 
[a, b] -f [6, c] -f [c, d] -f [d, a] appears. We mark this event by 
the addition of a vertical line in the persistence diagram. A 
second homological feature appears in the form of the cycle 
[a, 6] -f [b,d] -f [d,a] at time a = 0.7. This event is marked 
by the addition of a second vertical line in the persistence 
diagram. This homological feature dies at time a = 0.8 
when the 2-simplex [a,b,d] appears and makes the 1-cycle 
[a, b] -f [6, d] [d, a] a 1-boundary and therefore no longer a 
homological feature. This is marked by the addition of the 
point qi = (0.7,0.8) to the persistence diagram. At this time 
all simplices have been added to the filtration. This means 
that the homological feature [a, b] -f [&, c] -f [c, d] -f [d, a] never 
becomes a boundary of a 2-simplex. This is marked by adding 
the point q2 = (0.5,1) to the persistence diagram - recall that 
dissimilarities must be between 0 and 1 by definition. 

A few more examples are shown in Figure |6] Network (a) 
contains one feature that appears at a = 0.5 and stays alive 
until a = 1. Network (b) also has one feature born at a = 0.5 
that never gets trivialized until a = 1. Network (c) involves 
two features, born at a = 0.5 and a = 0.7 that also stay alive 
until a = 1. Network (d) is the same as the filtration in Figure 
|5}(top). 

We close this section by noting all dissimilarity values be¬ 
tween tuples with non-repeating elements of any dissimilarity 
network appear in the homological features of the induced 
filtration. 
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Proposition 2 Given a dissimilarity network D^, any of its 
k-order dissimilarities between tuples with unique elements 
appear either in the death time of the {k — l)-th dimensional 
homological features or the birth time of the k-th dimensional 
features. 

Proof: See Appendix lAl □ 

Proposition |2] is specific to filtrations induced from dis¬ 
similarity networks since it follows from the fact that adding 
elements to a tuple results in strictly increasing dissimilarities 
[cf. Definition m . The proposition implies that different net¬ 
works result in different persistence diagrams except in the rare 
cases when a single point in the diagram represents multiple 
homological features. Thus, persistence diagrams retain almost 
all of the dissimilarity values of a given network and are 
therefore not unreasonable proxies for network discrimination. 
We cement this intuitive observation in the next section by 
proving that differences between persistence diagrams yield 
lower bounds on the network distances defined in Section |II] 
We do so after an important remark. 

Remark 1 The maximum death time of any homological 
features in filtrations induced from dissimilarity networks is 
1. This is specific for filtrations constructed from dissimilarity 
networks because any undefined hyperedges in dissimilarity 
networks all take value 1 implicitly. In computing the persis¬ 
tence diagram, we can either (i) set the appearance time of all 
undefined hyperedges as 1 before evaluating the persistence 
diagrams, or (ii) determine the persistence diagram with no 
additional care of undefined edges, and then set the death 
time of any undead features to 1. These two procedures yield 
the same persistence diagrams. We use the latter method in 
practice because it is computational simpler. 

IV. Persistence Bounds on Network Distances 

In this section we use differences between persistence dia¬ 
grams to compute lower bounds of network distances. Recall 
that a persistence diagram VkC{Dx) is a collection of points 
of the form q = [qb,qd] where qb and q^ > qb represent the 
birth and death time of a fc-homological feature [cf. Figure |6l. 
To compare persistence diagrams 'PkJC-{D^) and TkC-i^Dy) 
of networks and Dy we begin by defining the cost of 
matching features q G Q and q G Q through the infinity 
norm of their difference, 

l|q- q||oo := max [\qb - qb\, \qd - qd\]- (H) 

Further observe that the diagonal of a persistence diagram can 
be construed to represent an uncountable number of features 
with equal birth and death times. Thus, any feature q G Q 
can be compared to any of the artificial features of the form 
q = [q,q]. Since this feature q can be placed anywhere in 
the diagonal of the persistence diagram, we choose to place it 
in the point that makes the infinity norm difference smallest. 
This point is q = [{qb + qd)/^, {qb + gd)/2] which yields 
the difference ||q — q||oo = \qd — 9f)|/2. Likewise, artificial 
features q = [(gf, -|- qd)/‘^, {qb + 9d)/2] can be added for any 
point q G Q. We can then rephrase the comparison in (1111 1 


so that if it is more advantageous to compare with artificial 
diagonal features. This is formally accomplished by defining 
the matching cost c(q, q) between q and q as 


c(q,q):=min ||q- q||^, (l/2)max[|gd - qb\, \qd - qb\] 


( 12 ) 


The norm ||q — qll^^ is the cost of directly matching features 
q and q. The term (1/2) max [ \qd — qb\ , \qd — qb\ is the cost 
of matching both, q and q to artificial diagonal features. The 
cost c(q, q) of matching q and q is the smaller of these two. 

If the respective collections of features Q and Q contain the 
same number of elements m = m we can consider bijections 
TT : Q —!• Q from Q to Q. The bottleneck distance between the 
persistence diagrams Vkl~-{D^) and Vkdl{DY ) of networks 
and Dy is then defined as 

b^{Dx,Dy)-.= min maxc(q, 7r(q)). (13) 

qeC 

The distance in ([T3l l is set to the pair of features q and q that 
are most difficult to match across all possible bijections tt. 

When the number of points in the persistence diagrams are 
different we assume without loss of generality that rh < m. 
In this case we extend Q by adding rh — m diagonal features 
to define the set Qe := Q U {(%,Since the sets 
Q and Qe contain the same number of elements bijections 
TT : Q —> Q are well defined. We can then modify (lOT l to 
define a valid comparison between VkC. and VkJO- when the 
number of points in the diagrams are possibly different. We 
do so in the following formal definition. 


Definition 5 Given persistence diagrams Vkd^{Dx) and 
VkC,{DY ) with sets of points Q and Q having cardinalities 
rh < m, define the extended set Qe '■= QG {(qu ft)}//!™ 
by adding rh — m artificial diagonal features. The bottleneck 
distance between the persistence diagrams VkiO,{Dx) and 
Vk^^{DY ) of networks and Dy is defined as 

h^{Dx,Dy)-.= min maxc(q, 7r(q)), (14) 

l^-.Q^Qe qe® 

where tt ranges over all bijections from Q to Qe and the cost 
c(q, 7r(q)) is defined in (I121 l. 

The number of bijections between two sets of points is 
factorial and it appears that the problem as in dTHl is as difficult 
as the problem of finding the correspondence in evaluating 
the network distance. However, the problem in (O is a 
instantiation of the Linear Bottleneck Assignment Problem 
(LBAP) that can be solved efficiently - see Section rvlland 
ll29l Algorithm 6.1]. We emphasize that c(q, q) = c(q, q') 
for any q whenever q and q' are on the diagonal. Therefore, 
the locations of diagonal points added to construct Qe are 
unsubstantial as per their cost. 

We prove now that the bottleneck distance between the 
persistence diagrams of the filtrations induced by two dis¬ 
similarity networks is a lower bound of their dissimilarity 
network distance. 
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Fig. 7. An example to construct augmented networks. and Dy are 1- 
order dissimilarity networks. The augmented network using correspondence 
C induces = D^. An additional node 02 and two edges are augmented 
in Ay. 

Theorem 1 Let and Dy be two K-order dissimilarity 
networks. The bottleneck distance between the k-th dimen¬ 
sional persistence diagrams of the filtrations C{D^) and 
C{Dy ) is at most rfr),oo(-D^, )/or any 0 < k < K, i.e. 

b\D^,DP)<dT,^^{D§,D^). (15) 

Proof : In order to prove Theorem [T] we introduce the fol¬ 
lowing notion of augmented networks. This dehnition solves 
the issue when a single node in one network has multiple 
correspondents in the other network. In such cases, say, if a 
in X, has both c and d in F as correspondents, we examine 
the difference between r^{a, a) and r^(c, d) in evaluating the 
network difference. However a) = rx(a) represents the 
birth time of [a] while ry (c, d) denotes the time of appearance 
of an edge [c, d]. The augmented networks are introduced to 
resolve this discrepancy. 

Definition 6 Given two K -order dissimilarity networks 
and Dy and a correspondence C between their node sets X 
and Y, the augmented networks q and Ay q are a pair 
of K-order networks defined on C. Each node Oi in C 
and AyQ represents a correspondent pair {xc'i,yCi) ifi C. 
Relationship functions for a^-.k with 0 < k < K are defined 
as 

?’Ax(“0:fc) = r\{xCy.Ck)^ r\yiao:k) = Ty(yCo:CJ- ( 16 ) 

When the underlying correspondence is clear, C is omitted 
in the subscripts. For a pair of dissimilarity networks 
and Dy and a correspondence C between X and Y, the 
augmented networks A^ and Ay have identical |C| nodes 
where \C\ denotes the number of correspondence pairs in C. 
Each node in both A^ and Ay represents a correspondent 
pair (xCi,yCi) in C. For each tuple ao:fe, its relationship 
r\x {O'O-.k) for the network A^ is the same as the relationship 
rxixco-Ck) between the tuple xco-.Ck in dD^- An example 
to construct augmented networks is illustrated in Figure |7] 
Augmented networks satisfy the following properties. 

Fact 1 Given a K-order dissimilarity network and a 
correspondence C between the node sets X and Y, the 
augmented network A^ constructed by Definition |6| is a valid 
K-order network. The induced filtration C{Ax) is a valid 
filtration and has identical persistence diagrams as C{Dx), 
i.e. for any 0 < k < K, 

VkC{A^)=rkC{D^). (17) 


Proof: See Appendix iBl □ 

Back to the proof of Theorem [T] Let d = di5,oo(77^, Dy). 
From Definitions |2 there exists a correspondence C between 
X and Y such that |r^(a;o:fc)—ry (yoife)! ^ ^ for any k and any 
pairs of correspondents {xo-.k, yo-.k) = {xo,yo), ■ ■ ■, {xk,yk) € 
C. Construct the pair of augmented networks A^ and Ay 
defined in Definition |6] using this correspondence. It follows 
from dmi that given a tuple ooifc € C with each node Oi 
representing the correspondent pair {xci, yCi) in C', 

kL(ao:fc)-rl^(ao:fc)| = |r|(xco:cJ-ry(2/Co:cJ| <^- 

( 18 ) 

Since A^ and Ay have identical nodes, (fTsT i implies that any 
simplices that appear at time a in the induced filtration C{A^) 
will appear no earlier than a — 5 and no later than a -I- d in 
the induced filtration C{Ay). Using the interleaved theorem 
ll^ Prop. 4.2], this yields the bound 

b'^{A§,A^)<S = dvMD^^DP). ( 19 ) 

From Fact[T] b^{Ax,Ay) = b’^{Dx, Dy). Substituting this 
into (fT^ concludes the proof. □ 

When comparing high order networks via their induced 
persistence diagrams, Theorem[T|provides justification that the 
dissimilarity obtained from persistent homologies is a lower 
bound on their infinity norm network distance. In particular. 
Theorem [T] implies that; (i) A large difference in persistent 
homologies imply the networks to be highly different, (ii) 
Lower bounds can be used to estimate distance intervals 
because upper bounds are easy to determine using specific 
correspondences as per Definition |2] 

We note that the differences between persistence diagrams 
do capture important differences between networks. E.g., in 
Figure |6] we can consider network (b) as a modification of 
network (a) in which we separate node a into the closely 
related nodes a and b that have a dissimilarity rx{a, b) = 0.1. 
The persistence diagrams of these two networks are identical, 
which is consistent with the relative proximity of these two 
networks. Network (c) is more different, because the addition 
of the dissimilarity r\{b,d) =0.7 complicates the argument 
that nodes a and b can be well represented by a single node 
as in network (a). Network (d) can be argued to be in between 
networks (b) and (c) since the dissimilarity rx{a,b,d) = 0.8 
implies a sense of added proximity between nodes a, b, and d. 
The difference between the persistence diagrams of networks 
(c) and (b) is indeed larger than the difference between the 
diagrams of networks (d) and (b). 

Further evidence for the usefulness of the bound in (fTsT i 
follows from the fact that the bound is tight in some cases. 
An example of a tight bound is shown in Figure The 
optimal correspondence C, shown in the figure, yields a 
network distance dx) ^oo{D\, Dy) = 0.1. The coordinates 
of the points in the 0-dimension persistence diagrams for 
C{D\) are (0, cx)), (0.12,0.42), (0.2,0.32) and for C{D^) 
are (0.1, oo), (0.21, 0.51), (0.25, 0.39). The coordinates of the 
point in the 1-dimension persistence diagrams for C{Dx) are 
(0.6, oo) and for C{Dy) is (0.5, oo). The bottleneck distances 









Fig. 8. An example where the bottleneck distance between the 
fc-th dimensional persistence diagrams of the filtrations C(D^) and 
C(Dy) is the same as their oo-norm network distance for k G 
{0,1}. The optimal correspondence C yields d-jy ^ao{D^, Dy) = 0.1. 
The points in the 0-dimension persistence diagrams for C(D^) are 
with coordinates (0, oo), (0.12, 0.42), (0.2, 0.32) and for Zi(Dy) are 
(0.1, oo), (0.21, 0.51), (0.25, 0.39). The point in the 1-dimension persis¬ 
tence diagrams for C{D^) is with coordinate (0.6, oo) and for C{Dy) 
is (0.5, oo). The bottleneck distances between the 0- as well as the 1- 
dimensional persistence diagrams of the filtrations induced from the two 
networks are 0.1. 


between the 0- and 1-dimensional persistence diagrams of the 
induced hltrations are 0.1; same as dT>,oo{Dx, Dy)- 

A. Persistence Bounds on k-order Distances 

Theorem [T] guarantees that the cxD-norm dissimilarity net¬ 
work distance d-D^oo can be tightly bounded by persistence 
diagrams. In this section we build on Theorem [T] to show 
that the fc-order network distance d^ can also be bounded by 
persistent methods. 

Theorem 2 Given two dissimilarity networks and Dy 
and an integer 1 < k < K, the bottleneck distance between 
the k'-th dimensional persistence diagrams of the filtrations 
C{Dx) and C(Dy) is at most d!^{Dx, Dy) for any 0 < 
k' < k, i.e. 

b'^'{D^,D^)<dUD^,D^). (20) 

Proof: We prove Theorem from Theorem [1] To do that, we 
leverage the relationship for network distances as we present 
next. 


Fact 2 Given high order networks Np and Np, 

we have 

dp{Np,NP)<dp{Np,NP), 

(21) 

for 0<k'<k — 1<K, and 


dM,oo{NP,NP) = dP{NP,NP). 

(22) 

Proof: See Appendix IbI 

□ 

Fact 12] implies that dp increases as the order fc becomes 


higher, and the oo-norm distance d^^oo is the same as the 
iT-order network distance. The result in Theorem |2] only 
considers relationship functions of order k, and to leverage 
the connection as in Fact |2] we introduce the following notion 
of truncated networks. 

Definition 7 Given K-order network = 

(^> ■ 1 ''’x)’ its k-order truncated network Ny is 

defined as = [X, ,..., r^). 

A fc-order truncated network has the same node set as 
its parent network Ny and collects the lowest fc -b 1 order 
relationship functions of ■ Back to the proof of Theorem 
|2] construct the fc-order truncated networks and Dy from 


Dx and Dy. It follows directly from Theorem [T] that for any 

0 < fc' < fc - 1, 

b^\D^,D§) < dv^UDx^D'f). (23) 

Since Dx and Dy are valid high order networks. Fact |2] 
implies that d-p^ooiOxT Dy) = dlf^DxjDy). Meanwhile, 
d^{Dx,Dy) = d^{Dx,DY) follows from the facts that 
and have identical fc-order relationship functions and 
Dy and Dy have same fc-order relationship. Finally, for any 
fc' < fc — 1, the fc'-th dimensional persistence diagram of 
C{Dx) is identical to that of C{Dx) and the fc'-th dimensional 
persistence diagram of C{Dy) is identical to that of C{Dy ), 
therefore b^' {D’f^Dy) = b^' {Dx^Dy). Combining these 
observations, the proof concludes. □ 

Theorem |2] bounds the fc-order network distance by the 
bottleneck distance between persistence diagrams of any order 
fc' < fc. We note that best lower bound is not necessarily 
fc' = fc—1. Observe that the result in Theorem|2]does not apply 
for fc = 0. This is not a problem because the 0-order network 
distances d^ examine relationships between individual nodes 
only. Thus, we can compute the optimal correspondence in 
Definition |2]by matching each of the relationships (x) in the 
network to the closest relationship ry {y) in the network 
Dx- We emphasize that the lower bound described in Theorem 
in is also tight since we can find dissimilarity networks Dx 
and Dy such that d^{Dx, Dy ) equals the bottleneck distance 
between the fc'-th dimensional persistence diagrams of the 
filtration C{D\) and C{Dy) for any G < k' < k — 1 < K. See 
Figure |8]for an illustration where bottleneck distances between 
the fc-dimensional persistence diagrams coincide with 

One of the key arguments in the proof of Theorem |2] is the 
fact that the infinity norm network distance dx'^ooiXx , XP) 
and the iT-order network distance dP{Np, Np) coincide [cf. 
Fact 121. This equality implies that Theorem [T] follows as a 
particular case of Theorem |2] by setting k — K in (EOjl 
and observing that dj^^oo{Np,Np) = dp-{Np, Np). Do 
notice that this doesn’t mean that the proof of Theorem [T| 
is redundant because Theorem [T] is leveraged in the proof of 
Theorem |2] However, the result does imply that the infinity 
norm distance d_x^ooiNp, NP) does not contain information 
beside the one that is contained in the collection of fc-order 
distance dp{NP, NP ) for k = 0,..., K. 

Another consequence of Theorem |2] is that we can bound 
the p-norm network distance dT>,p{Dp, Dp) [cf. DefinitionO 
for arbitrary p. This can be done using the individual bounds 
for dp{DP, Dp) established in (l20l i as we show next. 

Corollary 1 Group the k-order dissimilarity network dis¬ 
tances in the vector dP{Dp,Dp) := Dy), 

dplDP, Dp), ..., dP{DP, DP)\^. Further define the vec¬ 
tor h^{Dp,Dp := [dl{Dp,DP), b^pDp,DP), ..., 
b^^{Dp,DP)^ whose first component is the zero order 
distance d^{DP, Dp) and whose other components are bot¬ 
tleneck lower bounds with ki < 1. The p-nonn network 
distance can be lower bounded as 

\\h^{Dp,DP)\l<\\dP{Dp,DP)\l<dvADP,DP). 

( 24 ) 
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Proof: The second inequality follows from the fact that a single 
correspondence is used in dx>^p whereas an order-specific cor¬ 
respondence may be utilized in each /c-order distance 
For the first inequality, b^‘ Dy ) < Dy ) comes 

from Theorem (|2]). This implies that the vector {D^, Dy) 
is element-wise smaller or equal to the vector d^{D^, Dy). 
□ 

Corollary [T] shows that the p-norm network distance d-p^p 
can be lower bounded using: (i) The exact value of the 0-order 
network distance, (ii) The persistence homology lower bounds 
in (I 20 I) of Theorem |2] The bounds in Corollary [T] are expected 
to be the most useful when they combine information gleaned 
from persistence diagrams of different orders. 

Remark 2 The requirement of having the inequality in (|6]l 
be strict unless all the elements of xo-,k appear in a;o:fc-i 
for the network to be a dissimilarity network [cf. Definition 
11 is not always naturally satisfied in practice. As we have 
seen, its satisfaction may necessitate the addition of a small 
but arbitrary constant to some tuple relationships [cf. Figure 
[11. This is not a problem in practice because Theorems [T] 
and [ 2 ] still holds if this technical condition is violated. The 
only modification in the statements is that the respective 
network distances are not metrics but pseudometrics. What this 
means is that it is possible to construct pathological examples 
of networks that satisfy the order decreasing property not 
strictly and that have null network distance while not being 
isomorphic. These networks are unlikely to appear in practice 
and, even if they do, the bounds in Theorems [T] and [2] hold, 
albeit with a different interpretation. 

V. Proximity Networks 

In dissimilarity networks relationships r\{x[).k) encode a 
level of dissimilarity between elements of the tuple. The 
relationship r\{xQ-k) may, alternatively, describe similarity or 
proximity between elements. To account for this, we define 
proximity networks as those in which the relationship in (|6]l 
is flipped to 

rx{xo-.k) < r^"^(a;o:fc-i), (25) 

with the inequality equalized if and only if the element Xk 
also appear in x^-k-i- We refer to (1251) as the order decreasing 
property and denote the set of all proximity networks of order 
K as ■ When the input networks of Definition [2] or [3 are 
proximity networks, we refer to the network distances as the Re¬ 
order proximity network distance dp or the p-norm proximity 
network distances d-p^p. The restrictions to proximities make 
the distances well-defined metrics in the respective spaces 
1221 . 

As is the case with the order increasing property in dis¬ 
similarity networks, the order decreasing property is well jus¬ 
tified in proximity networks. E.g., consider a different aspect 
of coauthorship in which the fc-order relationship function 
between authors is proportional to the number of publications 
coauthored by all members of a given (fc -f 1)-tuple. In 
particular, the zeroth order proximities are the numbers of 
papers published by authors normalized by the total number 



order relationship in this 2-order network describes the number of publications 
between members of a (fc + 1)-tuple normalized by the total number of 
papers. E.g., A writes 11 papers in total, and coauthors 4, 2, and 2 papers 
respectively with B, D, and C. The e in the relationship function is a technical 
modification to differentiate relationship between two authors from that of a 
single author. Author A also writes 1 paper jointly with B and C. 

of papers. In Figure 0 authors A, B, C, D publish 11, 9, 2, and 
5 papers respectively and there are 19 papers in total which 
implies r°x{A) = 11/19, r\{B) = 9/19, r\{C) = 2/19, 
r\{D) = 5/19. The first order proximities r\ represent the 
number of papers co-published by nodes. Since a collaboration 
for a pair of authors is also a paper for each of the individuals 
it is certain that r^ix^x') < r\{x). In Figure |9l A and 
B collaborate on 4 papers, which is less than the 11 and 9 
papers written by each of the individuals. Authors A and C 
as well as A and D coauthor 2 papers in total. Second order 
proximities for triplets indicate the normalized number 
of papers coauthored by the three members. Since a paper 
with three authors is also a collaboration for the three pairs of 
authors we must have r\{x,x',x") < r\^{x,x'). In Figure |9] 
authors A, B, and D cowrite 2 papers, which is no more than 
the number of pairwise collaborations between each pair of the 
authors. An e constant is subtracted from some proximities to 
satisfy the technical condition that r^{xo:k) = r^^{xo-.k-i) 
if and only the elements in xqx are all elements of a:o:fc-i. This 
is not a concern in practice either - see Remark [2] Undefined 
hyperedge relationships in proximity networks take the value 
0 which is the minimum possible proximity for a tuple. 

For any proximity network with relationships Px{xo:k) 
we can define the dual network Dy with relationship functions 
dx{xQ.,k) '■= 1 — Pxi^o-.k)- It is not difficult to see that Dy is 
a well defined dissimilarity network and any of the network 
distances between the dual dissimilarity networks coincides 
with the respective distance between the proximity networks 
II 22 I . It follows that the bounds in Theorems (Tj and [2] apply 
for proximity networks with the caveat that the flltrations 
VkJC-{Dy) and VkC-{Dy) are flltrations of the dual networks 
associated with the given proximity networks Py and P^. 

VI. Implementation Details 

We discuss implementation issues regarding the procedure 
for evaluating lower bounds, its computational cost, and some 
heuristic simplifications. 

Procedure. To compute distance lower bounds for a dis¬ 
similarity network we construct flltrations and evaluate persis¬ 
tence diagrams. Algorithmic procedures to compute flltrations 
and persistence diagrams are discussed in El, jn]. The 
bottleneck distances {Dy , Dy) between the persistence dia¬ 
grams of different orders 0 < k < K are then evaluated. These 
distances follow from Definition H and are found as solutions 
of LEAP with costs given by (ITSl i. If the interest is in the 
infinity norm network distances dp^oo, any of the bottlenecks 
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distances is a lower bound [cf. Theorem[T]. If we are interested 
in the fc-order network distance d^, the bottleneck distance 
between persistence diagrams of dimension k' < A: — 1 is 
a lower bound [cf. Theorem IJJ. When given a proximity 
network we construct the dual network with relationships 
Txi^o-.k) = 1 — i"x{xo:k) and proceed as before. 

Computational cost. The process above involves computation 
of the persistent homology and evaluation of the bottleneck 
distance. Given a iT-order network with n nodes, denote as n 
the total number of simplices and m as the number of points 
in the persistence diagram. The complexity of computing a 
persistence diagram is akin to matrix multiplication on the 
number of simplices. This yields a computational complexity 
of order Il24l . OTI . The complexity of evaluating 

the bottleneck distance using LEAP [cf. (O] is of order 
logm) li^ Algorithm 6.1]. The number of simplices 
of order k can be as large and the number of points in 
a persistence diagram as large as the number of simplices. 
This means that we can have m = n = , which yields 

impractical computational times for orders larger than K = 2. 
In the practical examples we consider in Sections IVII-BI 
and IVII-CI the number of simplices h is much smaller than 
the maximum possible and scales linearly with the number 
of nodes. This yields complexity for computing 

the persistence diagram and 0 {n?'^ logn) for the bottleneck 
distance. 

Undefined tuple relationships. Formally, dissimilarity networks 
require that all undefined tuple relationships take value 1. As 
defined above this is impractical because it would scale the 
total number of simplices as fi = n*. In computations we 
leave these simplices undefined and let features die at infinity. 
As it follows from Remark [T] the death time of these features 
can be set to a = 1 to yield the same result that would be 
obtained from setting undefined tuple relationships to 1 before 
computing the persistence diagram. 

Elimination of small homological features. We can reduce the 
number of points in a persistence diagram by removing points 
q close to the diagonal. This is justified because: (i) These 
points represent ephemeral homological features and are likely 
to be generated by noise in observations, (ii) They contribute 
a small value to the cost in dEll and won’t improve the 
lower bound much from the trivial nonnegative lower bounds 
0 < d-D .oa{Dx T Dy) or 0 < d'f,{D^, Dy) that hold for any 
distance. 

VII. Applications 

We illustrate the usefulness of homology methods through 
experiments in both synthetic (Section IVII-Ab and real-world 
data (Sections IVII-BI and IVII-Cl l. The objective is to demon¬ 
strate that networks with similar structures and should be alike 
to each other are indeed similar in their persistent homologies. 

A. Classification of Synthetic Networks 

We consider three types of synthetic weighted pairwise 
networks. Edge weights in all three types of networks encode 
proximities. The first type of networks are with weighted 


Erdos-Renyi model ll3^ . where the edge weight between any 
pair of nodes is a random number uniformly selected from the 
unit interval [0,1]. In the second type of networks, the coor¬ 
dinates of the vertices are generated uniformly and randomly 
in the unit square, and the edge weights are evaluated with 
the Gaussian radial basis function exp(— ci(i, j)^/2cr^) where 
d{i,j) is the distance between vertices i and j in the unit circle 
and and cr is a kernel width parameter. In all simulations, we 
set tr to 0.5. The edge weight measures the proximity between 
the pair of vertices and takes value in the unit interval. In 
the third type of networks, we consider that each vertex i 
represents an underlying feature G of dimension d, 
and examine the Pearson’s linear correlation coefficient pij 
between the corresponding features and Uj for a given pair 
of nodes i and j. The weight for the edge connecting the pair 
is then set as pij/2 -b 0.5, a proximity measure in the unit 
interval. The feature space dimension d is set as 30 in all 
simulations. 

We start with networks of equal size |A| = 30 and 
construct 50 random networks for each aforementioned type. 
The edges with weights no greater than a threshold r are 
removed to create sparsity. We set r = 0.2 in all simulations. 
In order to transform the constructed pairwise networks into 
high order proximity networks, the 0-order proximity for 
any node is set to 0, i.e. r\{x) := 0 for any x G X. 
We then use the persistent homology method described in 
Section|VT]to evaluate the dissimilarities between all networks. 
The persistent homologies are computed using JavaPlex 
Eigure [TO] (a) and (b) plot the two dimensional Euclidean 
embeddings li^ of the network metric lower bounds 6° and 
between the 0- and 1-dimensional persistence diagrams 
respectively. Networks constructed with different models form 
clear separate clusters with respect to where networks 
with Erdos-Renyi model are denoted by red circles, networks 
with unit circle model are described by blue diamonds, and 
correlation model represented as black squares. The clustering 
structure is not that clear in terms of but the dissimilarities 
between networks constructed from different models are in 
general much higher than that between networks from the 
same model, and an unsupervised classification with one linear 
boundary in the embedded space would yield 10 out of 150 
eiTors (6.7%). 

Next we consider networks with number of nodes ranging 
between 30 and 39. Ten networks are randomly generated 
for each network type and each number of nodes, resulting 
in 150 networks in total. Eigure [TO] (c) and (d) illustrate 
the two dimensional Euclidean embeddings of the network 
metric lower bounds 6° and b^ for these networks. Despite the 
fact that networks with same model have different number 
of nodes, dissimilarities between persistent homologies are 
smaller when their underlying networks are from the same 
process. Besides, networks with the correlation model are 
highly similar to each other regarding their corresponding 
persistent homologies, irrespective of the sizes of the networks 
nor the dimension of persistence diagrams. An unsupervised 
classification with one linear boundary would yield 6 out 
of 150 errors (4.0%) for 6° and 7 errors (4.7%) for b^. 
The performance of 6° stays relatively unchanged when the 
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(a) , 30 nodes (b) , 30 nodes (c) b^^, 30 to 39 nodes (d) b^, 30 to 39 nodes 

Fig. 10. Two dimensional Euclidean embeddings of the networks constructed from three different models with different number of nodes with respect to the 
network metric lower bounds and b^. In the embeddings, red circles denote networks constructed from the Erdos-Renyi model, blue diamonds represent 


networks constructed from the unit circle model, and black squares the networks from the correlation model. 



(a) death time in 'PqC, ascending order (b) death time in VoC, descending order (c) birth time in 'PiC, ascending order (d) birth time in 'PxC, descending order 

Fig. 11. Illustrations of the set of death time in 0-dimensional persistence diagrams VoC and the set of birth time in 1-dimensional persistence diagrams 
ViC for the filtrations constructed from the randomly generated networks with number of nodes ranging between 30 and 39. The top 50 networks represent 
those constructed from the Erdos-Renyi model, the middle 50 networks are with the unit circle model, and the bottom 50 are with the correlation model. 


number of nodes in the networks considered reside in a 
range of nodes. The simulations illustrate the applicability of 
persistent homology in identifying the patterns of networks of 
different processes. Similar results are obtained when different 
parameters are used in generating the networks. 

Here we give interpretations of why persistent homologies 
succeed in network discrimination. Since only pairwise re¬ 
lationships are examined, the bottleneck distance }p is de¬ 
termined by the death time of the 0-dimensional persistent 
homologies. If we consider that nodes connected by an edge 
of weight w become members of the same community at 
time w, the death time in VqC can be interpreted as the 
time instant when isolated nodes join the main community 
of the network. We focus our attention on the 150 networks 
with size ranging between 30 and 39. Each row in Figure [TT] 
(a) represents a network, and plots the death time in the 0- 
dimensional persistence diagram VqC of the filtration induced 
from that network in ascending order from left to right. The 
top 50 networks represent those with Erdos-Renyi model, the 
middle 50 are with the unit circle model, and the bottom 50 
are with the correlation model. Each row in Figure [TT] (b) 
plots the death time in VqC, in descending order from left 
to right, of the induced filtration. It is not a direct mirror 
of Figure [TT] (a) because the size of the networks ranges 
from 30 to 39 and so the number of points in the persistence 
diagrams are different. The time instants when isolated nodes 
join the main community in the networks with the correlation 
model are concentrated in the interval [0.25,0.4]. This is due 
to the fact that the linear correlation coefficient between two 
randomly generated feature vectors cannot be too positive 
nor too negative. Also, networks with unit circle model are 
different from those constructed with the Erdos-Renyi model 
because in the latter case, the distribution of death time of 
points in VqC has heavy tail towards 1, which results from 


the fact that some points in the unit circle may be far away 
from the main component and it requires larger distance for 
them to join the main component. 

For higher order persistent homologies, iC is only affected 
by the birth time of the 1-dimensional persistent homologies, 
which can be interpreted as the maximum of the three pairwise 
connections between three nodes and therefore the time instant 
when a ‘closely-connected’ community is formed by the three 
nodes. Each row in Figure[TT](c) and (d) exhibits the birth time 
in ViC, in descending and ascending order from left to right 
respectively, of the filtration induced from the corresponding 
network. The time instants when three nodes in the networks 
with coiTelation model form a closely connected triplet are 
highly focused in the interval of [0.4, 0.6]. This is due to 
the fact that the when the coiTelation coefficient pij between 
features and Uj are small and pik between and Uk 
is small, pjk between Uj and cannot be too small. For 
networks with unit circle model, the time points for many 
nodes in the network forming a closely connected triplet are 
either smaller or larger compared to the Erdos-Renyi model. 
This is because if three points in the unit circle are close 
to each other, all the pairwise distances between them would 
not be high; otherwise the minimum inscribed circle of the 
three points would possess large radius, resulting in a high 
value on the maximum of the pairwise distances. Admittedly, 
other methods to compare networks may also succeed in 
distinguishing networks, after some proper treatment towards 
the issue of different sizes. Nonetheless, persistent homology 
method would be more universal, not only for the reason that 
it establishes a lower bound to the actual network metrics, 
but also since it provides a systematic way to analyze the 
formation of communities in a given network. 
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Fig. 12. Two dimensional Euclidean embeddings of the networks constructed from quinquennial publications in engineering and mathematics journals with 
respect to the network metric lower bounds b^{'PoC), h^{'P\C), and b^{'P 2 C). In the embeddings, red circles denote networks constructed from mathematics 
journals and blue diamonds represent networks from engineering journals. Networks constructed from publications of TSP are labeled. 
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Fig. 13. Two dimensional Euclidean embeddings of the networks constructed from annual publications in TAC, TSP, and TWC with respect to the summation 
of the metric lower bounds b^{'PoC), b^('PiC), and b^{'P 2 C,). In the embeddings, red circles represent TAC, blue diamonds TSP, and black squares TWC. 


B. Comparison of Coauthorship Networks 

We apply the lower bounds to compare 2-order coauthorship 
networks where relationship functions denote the number of 
publications of single authors, pairs of authors, and triplets. 
These coauthorship networks are proximity networks because 
they satisfy the order decreasing property. We consider pub¬ 
lications in 5 journals from mathematics community: Com¬ 
putational Geometry (CG), Discrete Computational Geometry 
(DCG), J. of Applied Probability, (JAP) J. of Mathematical 
Analysis and Applications (JMAA), SIAM J. on Numerical 
Analysis (SJNA), and 6 journals from engineering community, 
all from IEEE: Signal Processing Magazine (SPM), Trans. Au¬ 
tomatic Control (TAC), Trans. Pattern Analysis and Machine 
Intelligence (TPAMI), Trans. Information Theory, Trans. Sig¬ 
nal Processing (TSP), Trans. Wireless Communication (TWC). 
Eor each journal, we construct networks for the 2004-2008 and 
2009-2013 quinquennia. Eor TAC, TSP, and TWC, we also 
construct networks for each annual from 2004 to 2013. Lists 
of publications are queried from llT5l . 

Eor each of these journals we consider all publications in 
the period of interest and construct proximity networks where 
the node set X is formed by all authors of the publications. 
Zeroth order proximities are defined as the total number 
of publications of each member of the network, first order 
proximities as the number of papers coauthored by pairs, and 
second order proximities as the number of papers coauthored 
by triplets. To make networks with different numbers of 
papers comparable we normalize all relationships by the total 
number of papers in the network. The positive constant e as 
in Eigure |9] is set to 1/1000. There are papers with more 
than three coauthors but we don’t record proximities of order 
higher than 2. By assuming that networks from the same 
community or constructed from the same journal have similar 
collaboration patterns, we show here that network metric 


lower bounds succeed in identifying these patterns and in 
distinguishing coauthorship networks from communities with 
different interests. 

Eigure[T2]shows the two dimensional Euclidean embeddings 
of the network metric lower bounds 6°, and b^. The 12 
engineering networks (blue diamonds) separate clearly from 
the 10 mathematics networks (red circles) in b^ and b^. The 
clustering is not that clear in 6° but still networks from same 
community tend to be similar to each other. An unsupervised 
classification with one linear boundary running across the 
embeddings would generate errors of 2 (9.09%) to 5 (22.73%) 
out of 22 networks. Networks constructed from the same 
journal tend to be close in the lower bounds. As an example, 
the networks of TSP with different quinquennia are marked 
in the embeddings and it is clear that their differences in 
homologies are considerably low. Such scenarios are observed 
for several other journals as well. 

We analyze the persistent homology of each of the coauthor¬ 
ship networks to investigate the reason why persistent homolo¬ 
gies succeed in network discrimination. Compared to networks 
from mathematics communities, networks from engineering 
communities in general would yield 0-dimensional persistent 
homologies with smaller birth time but larger death time, 1- 
dimensional homologies with larger birth and death time, and 
2-dimensional homologies with larger birth time. An interpre¬ 
tation of such observations would be that in engineering, there 
exist more small communities that never collaborate with each 
other and it is uncommon to have a “club” of 3 to 5 authors 
in engineering that a strong collaboration exists between any 
pairs of the authors in the “club”; such scenarios are absent 
for mathematicians. 

As a comparison, we applied some simple and reasonable 
methods to compare the coauthorship networks considered in 
this section. Motifs have been shown effective in distinguish- 
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ing coauthorship networks from different scientific fields na. 
To compare high order coauthorship networks by motifs, we 
restrict attention to pairwise relationships. The dissimilarities 
between coauthorship networks are assigned as the differences 
between the summations of the weighted motifs in their 
corresponding pairwise networks. Analysis based on triangle 
motifs (weighted) results in a clear cluster between networks 
from CG, DCG, JAP and another cluster between networks 
from TSP, TWC, but cannot distinguish other networks very 
well. Tetrahedron motif analysis (weighted) results in three 
clear clusters; networks from CG, DCG, JAP, networks from 
JMAA, SJNA, SPM, TPAMI, and networks from TSP, TWC. 
Other simple and common methods to compare pairwise 
networks yield similar results. Methods to compare networks 
via features give us similar observations as those based on 
the persistent methods; feature methods would generate 6 to 
8 errors in classifications. 

C. Engineering communities with different research interests 

The network metric lower bounds succeed in distinguishing 
the different collaboration patterns in engineering and mathe¬ 
matics communities. We now illustrate that the lower bounds 
are also able to identify distinctive features of engineering 
communities with different research interests. To see this we 
consider the networks constructed from annual publications of 
TAC, TSP, and TWC. 

Figure [13] shows the two dimensional Euclidean embed¬ 
dings of the networks with respect to the summation of the 
lower bounds 5°, b^, and b^. We expect more variations in 
annual networks because the time for averaging behavior is 
reduced. Besides, it is hard to argue that intrinsic and obvious 
differences exist in the collaboration patterns in automatic 
control, signal processing, and wireless communication com¬ 
munities. Still, networks constructed from the same journal 
but different annuals tend to be close to each other and form 
clustering structures. An unsupervised classification with one 
linear boundary in the embeddings run across the summation 
of lower bounds would generate 4 (20%) errors out of 20 
networks in all three classification problems considered. The 
less obvious clustering structure formed by networks from 
different journals in Figure [T3] (c) compared to (a) and (b) 
also suggests that the collaboration patterns in research com¬ 
munities of signal processing and wireless communication are 
more similar compared to that of automatic control. 

VIII. Conclusion 

We establish connections between high order networks and 
simplicial complexes and use the differences between the in¬ 
duced homological features to evaluate the differences between 
networks. We justify that this is a lower bound to two families 
of valid metrics in the space of high order networks mod¬ 
ulo permutation isomorphisms. These lower bounds succeed 
in classifying weighted pairwise networks constructed from 
different processes, in distinguishing the collaboration patterns 
of engineering communities from mathematics communities, 
and in discriminating engineering communities with different 
research interests. 


Appendix A 

Proofs of Proposition|2] 

Given any tuple xo:fc with non-duplicating nodes, (doll 
indicates that the fc-simplex defined by the convex hull 
conv{a;o:fe} appears strictly after any of its faces conv{xo:j:fe} 
in the filtration. Suppose (j)^ appears at time a and denote 
dk4'^ = with the coefficients, then each 

appears strictly before time a. 

Now suppose that the appearance of (j)^ trivializes a (fc —1)- 
th dimensional homological feature. This means that (j)^ is the 
boundary to trivialize the (fc — l)-th dimensional cycle dk(t>^■ 
Since each face of appears strictly before time a, 

the cycle results in a homological feature. The death 

time of this homological feature is a, or equivalently, the time 
represented by the relationship r\(xQ-k)- 

On the other hand, if the appearance of (p^ does not trivialize 
a (fc — l)-th dimensional homological feature, then the (fc — 1)- 
cycle dk<p^ is in the collection of simplices appearing before 
or on time a. This means that dkp^ can be represented by a 
sum of the boundaries of some fc-chains <I>^, 

= (26) 

i 

with coefficients and fc-chains <I>f^ appearing before or on 
time a. By the definition of fc-chains, = Xj pijtpj with 
coefficients /3' and fc-simplices ip’p appears before or on time 
a. Therefore, (l26l l can be written as dkfp^ = Pjdk'Pj- 
Rearranging terms, 

=0- (27) 

3 

This implies that Xi Pi’’Pi — is a fc-cycle. There must be 
a new cycle formed since just appears. The cycle cannot 
be trivialized immediately since any {k + l)-chain with 

dk+i'^’‘~'~^ = Xi/3"V'i^ ~ would involve a simplex [xo-.k,i] 

for some node xi with tuple xo-,k,i consisted of non-repeating 
elements where this simplex [xo-,k,i] appears strictly after a. 
Therefore we have a fc-th dimensional homological feature 
with birth time a, or equivalently, the time denoted by the 
relationship r\{xw.k)- This concludes the proof. 

Appendix B 

Proofs of Fact [Hand Fact [2] 

Proof of Fact[TJ We need to prove the (i) symmetry property 
and (ii) identity property of high order networks, (iii) C{A^) 
being a valid filtration, and (iv) C{A^) and C{D^) having 
identical persistent homology. 

Proof of the symmetry property: For any tuples ao-.k, 
the symmetry property of dissimilarity network and 
the definition of augmented networks imply T^^(a[o:fe]) = 
Tx(a:[Co:Cfc]) = r\{xco-.Cu) = ^Ax(“ 0 :fc) for any reordering 
a[o:fc]- This shows the symmetry property of A^. □ 

Proof of the identity property: Given a tuple oo fc, if its 

subtuple a/o:i- have same set of unique elements as that of 
ttoife, according to the Definition of augmented networks, we 
would have that xco-.Ck xci -.Ci- also possess identical 
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set of unique elements. The identity property of dissimilarity 
network and the definition of augmented networks yield 

rAxi<^o-k) = r|(a:co:cJ = 

This shows the identity property of A^- □ 

Proof of £-{A^) being a valid filtration: The order increas¬ 
ing property of relationship function in augmented networks 
holds true due to the fact that 


rAiao-.k) = rxixCo:Ck)>rx \xCo:Ck-i)=rA Hao:k-l)- 

(28) 


The remaining proof is identical to the proof of Proposition 

[U □ 


Proof of C{Ax) and C{Dx) having identical persistence 
intervals: First, for each point x G X, pick one pair {x, y) 
from the correspondence C to construct Co that is a subset 
of C. If we define a map that maps each pair {x, y) to its 
first element x, this gives a bijective projection from Co to 
X. Construct A^^ x ™tl Aq x the augmented networks 
using the respective correspondence. It then follows naturally 
that Aq^ ^ is isomorphic to the the original dissimilarity 
network and so do the corresponding filtrations. Denote 

C := CiA^^x) and Co := CiA^^^x) = )■ 

Next, consider the projection of the filtration C onto Cq, 
where each vertex {x,y) in C is mapped to {x,yo) in Co 
who share the first element in the pair. Denote and Lq 
as the simplicial complexes that collect simplices appearing 
before or prior to a in the respective filtration. For each 
value of a, the projection defines a retraction of simplicial 
complexes Lq. Since the relationship functions are 

the same in the augmented network Aqx as in the original 
network Aq^ whenever a simplex a appears in L“, not 
only its projection appears in Lq at the same time, but all 
the simplices connecting vertices of a and vertices of its 
projection are also already in Lq. Hence, as a simplicial map, 
the projection L°‘ —> Lq is contiguous to the identity of L“. As 
a result, the section Lq —L“ is a homotopy equivalence. Its 
induced homomorphism at the homology level is therefore an 
isomorphism, and consequently the two persistence diagrams 
are isomorphic. □ 

Having demonstrated all statements, the proof completes. □ 


Proof of Fact |2j We need to prove the two statements in 
(ISTT l and (I 22 I 1 . Given any 0 < k' < k < K, it follows from 
dU that for any correspondence C between X and Y, 


^x,y(C') — max 

{xo-.k,yo-.k)£C 

"x,y{C) = , max 


rx{xo:k) - rY{yo-.k)\ , (29) 

rx{xo:k') - Ty {yo-.k') ■ (30) 


For the ^y^.f.,) G C achieving the maximum difference 
\xxixo-k')~XY iyo-k') \ ia Tx y(C'), we can construct another 
correspondent pair (a^Q.^^., t/p.^.) such that x^.f,, is a sub-tuple 
of XQ.f. with identical set of unique elements and y^.f., is 
a sub-tuple of with identical set of unique element. It 
follows from the identity property of high order networks 
that r’^xix*o,k') = x)c{xo,k) and r’({y*.j^,) = r^{y*o,k)- 
This implies that taking the maximum \r^{xo-k) — ry(yo:fc)| 


over {xo-.k,yo-.k) G C cannot yield a lower difference than 
kx«fe') i-e. 


^0:i 


rx.x(C) > Xxixo.^,) — ry iyo-k') 


= T 


k' 


{C). (31) 


Since (ISTT i holds true for any correspondence C G C{X,Y), 
the inequality must hold true when we take the minimum over 
all correspondences C{X, Y) between X and Y, 


min T\y{C)> min T%y(C'). (32) 

cgC(x,x) C'eC(x,x) 


Substituting the definition of fc-order and fc'-order network 
distances into (l32l l yields ^ concluding the proof of 

(EB. 

Also, it follows from © that for any correspondence C 
between the node sets X and Y, the network difference 
between Nx and Ny measured by C is 

||rf,y(C)||^= ^^max^^{r^,^(C')}. (33) 

From (IbTT i we know that for any fc', F^ yiC) > F^ yiC), and 
therefore for any correspondence C, max^Q {F^ = 

r^y(C'). Substituting this into (l3Tt and taking a minimum 
over all correspondences concludes the proof of (l22l i. 

Having proven the two statements, the proof completes. □ 
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